Audiovisual Speech Coder : Using Vector Quantization To Exploit The Audio/Video Correlation

نویسندگان

  • Elodie Foucher
  • Laurent Girin
  • Gang Feng
چکیده

Visual information can help listeners to better understand what is said. In the speech coding domain, it will be shown that it allows to reduce the transmission rate of a classic vocoder (1,9 kbit/s instead of 2,4 kbit/s) by estimating audio parameters from video ones. In addition, vector quantization seems to be a good method to reduce the redundancy between some audio and visual coefficients. With the vector quantization, we can reduce again the bit rate while decreasing the quantization error.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Vector quantization of glottal pulses

An efficient codebook driven voiced excitation coding method producing natural sounding speech is proposed. It can be incorporated as an essential part in a complete speech coder working at low bit rates. The inter-pulse correlation of such a coding scheme is investigated and exploited using linear predictive vector quantization and finite state vector quantization (FSVQ). A new and more robust...

متن کامل

Using Dynamic Codebook Re-ordering to E MELP Code

Model based speech coders such as the mixed–excitation linear prediction (MELP) coder encode parameters of the autoregressive model for short-duration frames of the speech signal. Typically, parameters extracted from successive frames by the MELP coder exhibit strong correlation. Reduction in the transmitted data-rates can be achieved if the encoders for these parameters effectively exploit thi...

متن کامل

Video compression using multiwavelet and multistage vector quantization

This paper presents a new video coding technique using multiwavelet transform and multi-stage vector quantization. Three types of redundancies that are common in video sequence are spatial, temporal and psycho visual redundancies. In this work, the spatial redundancy in the video is minimized using multiwavelet transform. The transform coefficients are then quantized using multi-stage vector qu...

متن کامل

A new audio coding scheme using a forward masking model and perceptually weighted vector quantization

This paper presents a new audio coder that includes two techniques to improve the sound quality of the audio coding system. First, a forward masking model is proposed. This model exploits adaptation of the peripheral sensory and neural elements in the auditory system, which is often deemed as the cause of forward masking. In the proposed audio coder, the forward masking is first modeled by a no...

متن کامل

A design of transform coder for both speech and audio signals at 1 bit/sample

This paper proposes a speech and audio coder which operates at 1 bit/sample, namely an 8 kbit/s coder for 8 kHz sampling or a 16 kbit/s coder for 16 kHz sampling. The basic structure is inherited from a TwinVQ (Transform domain Weighted Interleave Vector Quantization) high-quality audio coding scheme. Periodical component extraction scheme is newly added to the quantization of MDCT coe cients. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998